Quantitative analysis of culture using millions of digitized books.
نویسندگان
چکیده
We constructed a corpus of digitized texts containing about 4% of all books ever printed. Analysis of this corpus enables us to investigate cultural trends quantitatively. We survey the vast terrain of 'culturomics,' focusing on linguistic and cultural phenomena that were reflected in the English language between 1800 and 2000. We show how this approach can provide insights about fields as diverse as lexicography, the evolution of grammar, collective memory, the adoption of technology, the pursuit of fame, censorship, and historical epidemiology. Culturomics extends the boundaries of rigorous quantitative inquiry to a wide array of new phenomena spanning the social sciences and the humanities.
منابع مشابه
Historical Analysis of National Subjective Wellbeing Using Millions of Digitized Books
Historical Analysis of National Subjective Wellbeing Using Millions of Digitized Books We present the first attempt to construct a long-run historical measure of subjective wellbeing using language corpora derived from millions of digitized books. While existing measures of subjective wellbeing go back to at most the 1970s, we can go back at least 200 years further using our methods. We analyse...
متن کاملWhy the quantitative analysis of diachronic corpora that does not consider the temporal aspect of time-series can lead to wrong conclusions
Recently, a claim was made, on the basis of the German Google Books 1-gram corpus (Michel et al., Quantitative Analysis of Culture Using Millions of Digitized Books. Science 2010; 331: 176–82), that there was a linear relationship between six non-technical non-Nazi words and three ‘explicitly Nazi words’ in times of World War II (Caruana-Galizia. 2015. Politics and the German language: Testing ...
متن کاملThe Question of Re-Presentation In EFL Course Books: Are Learners of English Taught about New Zealand?
Increasingly intercultural dimension of communication in the 21st century has brought about challenging aims in EFL (English as a Foreign Language) pedagogy, such as ascertaining the enhancement of the learners' intercultural awareness and promoting their ability to communicate in intercultural settings. Taking the disadvantage of EFL environment in terms of intercultural input into ...
متن کاملContent analysis of 150 years of British periodicals.
Previous studies have shown that it is possible to detect macroscopic patterns of cultural change over periods of centuries by analyzing large textual time series, specifically digitized books. This method promises to empower scholars with a quantitative and data-driven tool to study culture and society, but its power has been limited by the use of data from books and simple analytics based ess...
متن کاملDiscovering the Ebb and Flow of Ideas from Text
T he rise and decline in popularity of ideas have a profound effect on human society. Tracing the ebb and flow of ideas has important implications for scientific and historical research because while newer, more accurate, or more useful ideas might be expected to consistently succeed older ones, in reality this isn’t always the case. In the 1840s, for example, when Ignaz Semmelweis d iscovered ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Science
دوره 331 6014 شماره
صفحات -
تاریخ انتشار 2011